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Abstract — We apply a dynamical systems approach to con- 
catenation of quantum error correcting codes, extending and 
generalizing the results of Rahn et al. [1] to both diagonal and 
non-diagonal channels. Our point of view is global: instead of 
focusing on particular types of noise channels, we study the 
geometry of the coding map as a discrete-time dynamical system 
on the entire space of noise channels. 

In the case of diagonal channels, we show that any code with 
distance at least three corrects (in the infinite concatenation limit) 
an open set of errors. For Calderbank-Shor-Steane (CSS) codes, 
we give a more precise characterization of that set. We show how 
to incorporate noise in the gates, thus completing the framework. 
We derive some general bounds for noise channels, which allows 
us to analyze several codes in detail. 

Index Terms — Quantum error correction, quantum channels, 
quantum fault tolerance 



I. Introduction 

In this paper we analyze quantum codes in essence, ab- 
stracting their details as codes and extracting their fault 
tolerance properties using a dynamical systems approach. This 
framework has been initiated by Rahn et al. [1]. They show 
how to incorporate diagonal noise on the qubit into an effective 
channel on the logical qubits. 

We broaden this viewpoint and extend their approach in sev- 
eral ways. We look at the effective channel from a dynamical 
systems point of view, using tools and methods from this field. 
In particular we characterize the region of correctable errors 
using tools from the analysis of fixed points and show how to 
incorporate perturbations of the coding map. 

Our second chain of results extends the results of [1] to the 
realistic model of faulty gates and general channels. Rahn et al. 
only analyzed the depolarizing channel on the physical qubits 
as the single source of noise. We show that incorporating 
noisy gates gives rise to a perturbed effective channel. We also 
analyze general noise on the qubits and give several bounds 
for the convergence of non-diagonal channels to diagonal 
channels. Our results are supported by several examples for 
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the family of CSS-codes, which is the encoding predominantly 
proposed for fault-tolerant quantum computing. We simplify 
our bounds in the case of CSS codes and analyze the [[7, 1, 3]] 
code, the smallest member of the CSS family, in great detail. 

a) Structure of the paper: We first introduce the dynam- 
ical systems approach in Section UTI and establish the notation 
and some basics. In Section we extend this approach 
to diagonal channels, including an analysis of regions of 
convergence. Section HV1 deals with faulty gates. In Section IV1 
we establish several results and examples for non-diagonal (i.e. 
general) noise channels and in Section lvTl we discuss a way to 
improve channels. Our approach allows to drastically reduce 
the number of parameters, lending Quantum error correcting 
codes to an elegant analysis. This however comes at some 
price, and in Section lvTll we outline some of the shortcomings 
of this approach, before concluding with some open questions. 

II. Notation and Framework 

In this section we formulate the basic framework and 
review the main results from [1], which should be consulted 
for details. Quantum states are represented by their density 
matrices. 

The error correction process consists of three parts: encod- 
ing £i, noise JV , and decoding Each part is modeled as 
a quantum channel, namely, a map taking density matrices to 
density matrices. Quantum channels are required to be linear, 
trace-preserving, and completely positive, hence of the form 



^A jP A], with 



I. 



(1) 



where Aj are linear operators and / is the identity (cf. [2]). 
In addition, we will assume that the channels are time- 
independent in order to simplify the study of their con- 
vergence. In the subsequent sections, we will often denote 
quantum channels by $. 

Encoding § takes an initial logical qubit state po to the 
initial register state p(0) which evolves according to some 
continuous-time noise dynamics. We consider the evolution 
for a fixed amount of time f, turning noise into a discrete- 
time operation jV which takes p(Q) into a final register state 
p{t) = jV(p(<S)). Finally, decoding 9 takes p(t) to the final 
logical qubit state pf. The map 

Sf = SJo.yV og : p -> p f 
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describes the effective dynamics of the encoded information 
resulting from the physical dynamics of JV and is called the 
effective channel. 

We consider noise models JV on n qubits consisting of 
uncorrected noise ,yV^ on each single physical qubit, so 

n times 



Given an n qubit quantum error correcting code C with 
encoding operation S and decoding operation 9l, the map 
taking the single qubit noise .jV^ to the effective channel 



o<f, 



(2) 



is called the coding map of C. 

The density matrix of one qubit can be expanded in the 
standard Pauli basis 9* — {I, X, Y, Z} for density matrices 
and represented as a four-dimensional real vector. A noise 
channel jY^' can then be represented as a 4 x 4 matrix 



/ 1 








\ 


Nxi 


Nxx 


Nxy 


Nxz 


Nyi 


Nyx 


Nyy 


Nyz 


\N ZI 


N Z x 


Nzy 


N ZZ J 



Zeroes in the first row are due to trace preservation, 
an arbitrary n qubit code C, the entries of the matrix 5 
Q C (^Y^) can be calculated to be 



(3) 



For 



(4) 



where fj, = (/i l7 



- Mi 



,), v = (fi, . . . , v n ) run over 9>® n 7 and 



a£ , f}° are the coefficients in the expansions for the encoding 
and decoding operations relative to 9>® n . See [1] for details. 

If the matrix Q is diagonal, JY^ 1 ' is called a diagonal 
channel. In that case, we write x — Nxx, y = Nyy, and 
z = Nzz and denote the channel by [x, y, z]. It was shown in 
[3] that complete positivity of such channels implies that the 
point (x, y, z) must be in the tetrahedron A defined by 



y + z < 1 

y + z < 1 
y — z < 1 
-x — y — z < 1. 



-x ■ 
x ■ 
x ■ 



(5) 



It is easily checked that a single-bit Pauli channel with 
exclusive probabilities < Px,Py,Pz < 1> 

p -> (1 - p x - p Y - Pz)p + PxXpX + p Y YpY + p z ZpZ, 

has the following representation in the above notation: 

[1 - 2(p Y +p z ), 1 - 2(px +Pz), 1 - 2(px +Py)]- 

In fact, any diagonal channel can be realized as a single-bit 
Pauli channel, so the parametrizations of A via [x,y, z] and 
via (px ,Py ,Pz) are equivalent. 

The n dimensional Pauli group is ^„ = {±1, ±z} (g> 9 > ® n . 
Suppose we have a stabilizer code that encodes k qubits into 



n. Its stabilizer S is an abelian subgroup of & n with n — k 
generators The 2 k dimensional codespace is defined as 

C s = € (C 2 )®" so that g\tp) = \ip) for all g e S}. 

The subset of ZP n that commutes with S is the centralizer, 
and it includes encoded operations we can perform on the 
codespace. We measure each generator g,, and let = if 
we project into the +1 eigenspace, and = 1 if we project 
into the —1 eigenspace. We then have an error syndrome j3 E 
F^ k , and we correct with a recovery operator Rp € £? n . 

It was shown in [1] that if C is a stabilizer code, then 
f2 c takes diagonal channels to diagonal channels. In fact, if 
Si, ... , S m are the generators of C, then 

ft C [x,y,z] = [n x (x,y,z),n^(x,y,z),n^(x,y,z)] , 



where 

Q%[x,y,z 



x wx (S k a) yW Y (S k cr) z w z (S k rr) 



— V" fh 

m ' 

k=l 



(6) 



and r)((T, a') = ±1, if aa' = ±a'a, for a, a' e {/, X, Y, Z}. 
Here, w a denotes the cr-weight, a is the encoded a, and 
the Rj denote recovery operators corresponding to the error 
syndromes. For later purposes, we extend r] as the natural 
homomorphism to the negative of the Pauli matrices by 
r}(-a, a') = rj(a, -a') = -r](a, a 1 ) = ij(-a, -a'). 

Therefore, the components of fl c [x,y,z] are polynomials 
of degree n in x, y, z. Observe, however, that in general fl c 
is a map from a higher dimensional space of non-diagonal 
channels to itself. Non-diagonal channels of particular interest 
to us are unital channels; a channel % is unital if (I) = I. 

An important result from [1] is that concatenation of codes 
translates into composition of coding maps. In other words, if 
Ci and Ci are codes and C\ o C2 denotes their concatenation, 
then 



Given a noise model and code C, we are interested in 

what this noise looks like under repeated concatenation of the 
code C with itself. Then the question is, does 

n c ° k (^ 1 '>) -» I, asfc^oo? 

If this is the case, C corrects the error given by ,yV^\ 

Rahn et al. [1] focus mostly on the symmetric depolarizing 
channel given in the above notation by [e -7 *, e -7 *, e -7 *] and 
derive threshold estimates for various codes. We take a global 
point of view, where instead of looking at noise channels point 
by point, we consider the behavior of the coding map as a 
discrete-time dynamical system and study the set of all noise 
channels attracted to the identity channel under iteration of the 
coding map. This approach enables us to use methods from 
the theory of dynamical systems. 
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III. Open set of correctable diagonal errors 

We will first focus on diagonal noise channels, i.e., those 
given by a diagonal matrix, as discussed in the previous 
section. The standing assumption of this section is therefore 
that all noise channels are diagonal. We saw that we can 
characterize the asymptotic properties of the coding scheme 
involving the concatenation of a fixed code C with itself by 
studying the long-term behavior of the dynamical system 

fl c : A -> A. 

We now review some necessary basics from the theory of 
dynamical systems. Good introductory references are [4] and 
[5]. 

A. Dynamical systems preliminaries 

A (discrete-time) dynamical system is a map / : M — * 
M, where M is a space with a certain additional structure 
(topological, metric, differentiable, etc.). In our case, it suffices 
to assume that M is some Euclidean space R k or a subset of 
it, and that / is a differentiable map. We denote by D/(p) the 
derivative of / at a point p and think of it as a linear operator 
on R k . We will denote by ||D/(p)|| the norm of D/(p) as 
such on operator; that is, 

\\T>f(p)\\ = max{||D/(p)t;|| : \\v\\ < 1}. 

(The norm on M fc is arbitrary but fixed.) If D/(p) depends 
differentiably on p, we define the second derivative of / in 
the usual way as D 2 / = D(D/); recall that D 2 /(p) can be 
thought of a bilinear map R k x K fe -> K fe and ||D 2 /(p)|| then 
denotes its norm. Continuing recursively, we say that / is of 
class C r (or simply C r ) if D r /(p) exists and is a continuous 
function of p. 

For p e M, the set {f n {p) : n = 0, 1, 2, . . .}, where f n = 
/ o • • • o / (n times), is called the orbit or trajectory of /. 
A fundamental question in the theory of dynamical systems 
is: what is the long term behavior of trajectories! That is, 
where does f n (p) end up eventually, as n — > oo? The set of 
accumulation points of the orbit of p is called the w-limit set 
of p. An example of such a set is a fixed point of /, i.e., a point 
p such that f(p) = p. A fixed point p is locally attracting if 
there exists a neighborhood V of p in M such that for every 
x e V, f n (x) — > p, as n — > oo. A basic criterion for a fixed 
point to be locally attracting is the following. 

Lemma 3.1: Suppose U C R fc is open, / : U — * K is a 
C 1 map, p e U is a fixed point of /, and A = ||D/(p)|| < 1. 
Then p is locally attracting. 

Proof: Let Ao < A < 1. Since T)f(x) depends continu- 
ously on x and ||D/(p)|| < 1, there exists a neighborhood V 
of p in U such that ||D/(x)|| < A, for all x e V. Then, by 
the Mean Value Theorem, 

\\f(x)-f(p)\\<X\\x-p\\, 

for all x e V. Therefore, 

\\m-p\\ = \\f n (x)-r(p)\\ 

< A n ||a;-p|| 
-0, 



as n — > oo. ■ 

The largest such set V is called the basin of attraction of the 
fixed point p, denoted by 3& '(p). Let B(x,r) denote the open 
ball of radius r centered at x. 

Lemma 3.2: Assume / is C 2 , the hypotheses of the previ- 
ous lemma are satisfied, and ||D 2 /(x)|| < K, for all x e U. 
Then B(p, (1 - X )/K) nf/c #(p). 

Proof: The proof goes along similar lines as the previous 
one. Let A < A < 1 be arbitrary and < r < (A — X )/K. 
For an arbitrary point x in the closed ball B\p, r] n U, we have 

\\Bf(x)\\ < ||D.f(x)-D.f(p)|| + ||D/(p)|| 

< Kr + Xo 

< A, 

that is, / is a contraction on B[x, r] n U. Furthermore, for all 

x e B[p,r] n U, 



\\m-p\\ 



< 



\\m-f(p)\\ 

X\\x-p\\ 



which implies that B[p, r]nU is /-invariant. Therefore, under 
iteration of /, every point in B[p,r] n U converges to p, so 
B[p,r] n U C -SS{p). Taking the union over all A e (A ,l) 
proves the claim. ■ 
Now take / = il c and observe that [1,1,1] is always an 
isolated fixed point of Qp , though not necessarily attracting. 
For instance, [1,1,1] is a saddle for the coding map i7 bf of 
the bit-flip code. However, if C is the Shor or five-bit code, 
then Dfi c [1,1,1] = 0, so [1,1,1] is locally attracting. The 
following result shows that this is not a coincidence. 

Proposition 3.3: Under the assumptions above, if C is a 
quantum error correcting code of distance > 3, then 

Dfi c [1,1,1] = 0. 
Proof: It suffices to show that D£l c sends three linearly 
independent vectors to zero. 

Since the distance of the code is at least three, C corrects 
all errors of weight one. In particular, it corrects all single-bit 
Pauli channel errors 

p — » (1 - e)p + sapa, 

for a E {X,Y,Z} and < e < 1. Such errors correspond 
to noise channels [1, 1 — 2e, 1 — 2s], [1 — 2s, 1,1 — 2s], and 
[1— 2s, l—2s, 1], for a = X, Y, Z, respectively. Let us consider 
a = X. To say that C corrects X-errors means that 

n c [l, \-2s,\- 2s] = [1, 1 - (9(e 2 ), 1 - 0(e 2 )]. 

This implies that the directional derivative 



Dfi c [l,l,l]vjc = — 

as 



n c ([1,1,1] +ev x ) =0, 



where vx = (0, — 1,— 1) T . Similarly, we can show that 
Dfi c [l,l,l]« y = BQ c [l,l,l]v z = 0, where v Y = 
(— 1,0, — 1) T and vz = (— 1,— 1,0) T . Since vx,vy,vz are 
linearly independent, it follows that Dfi c [l, 1, 1] = 0. ■ 
Corollary 3.4: For every code C of distance at least three, 
[1,1,1] is an attracting fixed point of the coding map Q c : 
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A — > A. If S$c denotes its basin of attraction and HD 2 ^! < 
K on A, then 

1 



fi([M,i] lf ]nAc 



(7) 



Proof: Observe that Q° can be extended to the whole 



space R 3 , has [1, 1, 1] as a fixed point, and, by Proposition l3.3l 
A = DO c [l, 1,1] =0. Therefore, [1, 1, 1] is locally attracting 
for fi c as a map R 3 -> tt 3 . By Lemma EH1 1, 1], 1/if) 

is contained in the basin of attraction of [1,1,1], again as a 
fixed point of fl c : R 3 — > R 3 . However, we know that A is 
an invariant set for rt c , i.e., f2 c (A) C A, and it contains 
[1, 1, 1]. Therefore, points in B([l, 1, l],l/K) n A are both 
attracted to [1, 1, 1] and stay in A under iteration of il c . This 
proves 0. ■ 
Proposition 3.5: Suppose G is a CSS code. It will be shown 
in Theorem 15. 71 that 

tl c [x,y,z] = [f(x),g(x,y,z),f(z)], 

for some polynomials /, g. Let a be the largest fixed point of 
/ in (0, 1). Then 

2$c = y> z ) G A : x > a, z > a}. 
Proof: It follows from Proposition 13.31 that 1 is an 
attracting fixed point of /. Let (a, 0) be its basin of attraction. 
It is well known that its boundary {a, 0} is /-invariant. Since 
a £ [a, 1) and [a, 1) is /-invariant, it follows that a is a 
fixed point of /. Therefore, a = a. This means that for every 
x £ (a, 1), f k {x) — * 1, as k — » oo. 

Now suppose [x, y, z] £ A, x > a, z > a. Then 

(n c ) k [x,y,z] = [f k (x),y k J k (z)]. 

We know that f k (x), f (z) — » 1. Let be an accumulation 
point of the sequence (yt)- Since [l,y»,l] £ A, it follows 
that = 1. Therefore, (^ c ) [x, y, z] — > [1, 1, 1], as fc — > oo, 
which implies {[x, y, z] £ A : x > a, z > a} C ^c 1 . 

To show the opposite inclusion, assume the contrary, i.e., 
that there exists a point p = [x, y, z] £ 2$c such that P & 
{[x, y, z] £ A : x > a, z > a}. Then x < a or 2 < a. In the 
former case, f k (x) does not converge to 1, and in the latter, 
/ fe (z) 1, contrary to our assumption that p is in the basin 
of attraction of [1, 1, 1], ■ 

IV. Faulty gates 

We want to extend the analysis in [1] to include faulty gate 
operations both in the error correction and in the computation 
circuits. Gate errors are a common form of noise in quantum 
information processing. We show how to incorporate faulty 
gates into the current framework and how they change the 
effective channel and the coding map. Note that fault tolerance 
for our noise model has been shown, but that there is some 
dispute about the validity of that model and whether quantum 
fault-tolerance is possible [6]. 

A. A simple noise model 

Our first approach is to start with a very simple error model 
for faulty unitary gates G: 

G:p^(l-e)Gp& +e-^I. (8) 



This error model is rather generic. It has the additional 
advantage that noise from sequential gates is additive; if we 
combine two faulty operations as in Eq. (|8}, we obtain 



G 2 o Gi : p 



G 2 [(l-e 1 )G lP G{ + ^I 



{l-e 2 ){l-e 1 )G 2 G l pG\G\ 



(1- 



£i - e 2 )G 2 G 1 p{G 2 G 1 ) 

£2 



N 



-I- 



(9) 



i.e. a faulty process with e = E\ + e 2 . As we have seen, 
the effective dynamics of one level of concatenation is simply 
encoding, noise and decoding, i.e. 

g = voj\fo£. 

Let us also assume here that the noise on the qubits is unital, 
i.e. Af(I) = I. We now show that faulty gates in this model 
have the same effect as noise; hence we can effectively treat 
noise from faulty gates and other types of noise on the qubits 
in the same way. 

The encoding operation can be written concisely as £(p) = 
BpB\ where B = |0)(0| + |1)(1| (or, for codes that encode 
more than one qubit, B = ^7Jj|i)(i|). This encoding is 
performed by applying a sequence of gates, possibly faulty, 
as in Eq. (|8j. The operation corresponding to B can be 
implemented with unitary gates in a larger space by appending 
some ancillary qubits, for instance as Ub ■ | j 0) — ► If 
errors occur according to Eq. (|8j, the resulting operation will 
be 8e B :p->(l- e E )U B pUl + ^-1 = (1 - e E )S(p) + ^ J, 
where £ denotes the error-free encoding and ee is the noise 
accumulated from gates during encoding. In an analogous way 
it can be seen that a decoding map T>, implemented with faulty 
gates, can be written as T> ED : p — > (1— £d)X>(/j)-|-^-I, where 
we have used that T> : -kl — ► 1 1. Putting this together under 
the simplifying assumption that jV(J) = I (unital channels), 
and using additivity of error from faulty gates, we get 



P 



£ 



where e = Ed+£e and Q is the effective channel with perfect 
gates. In other words, faulty gates only contract the iterated 
map by (1 — e). As a result, the coding map Q c (see Eq. 0) 
changes to VLf, the coding map with faulty gates, as 



flf : Af — ► (1 - e)V o M o £ + e-I = (1 - e)n c + e-I. 



;i/=(l- e )n c + e i 

The entries of the matrix Q for the coding map change as 



(1 - e)G aal 



(10) 



where we have used the fact that the coding map whose only 
non-zero entry is Gn represents a mapping of p to the identity 
matrix. In other words, the incorporation of faulty gates into 
our analysis results in an affine mapping of the coding map: 
G is contracted by (1 — e) and the element eSu is added. 
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B. More general noise 

It is not difficult to extend this analysis to more general 
noise in the gates and general noise on the qubits. Let us 
assume that instead of the restricted noise model of Eq. (|8} 
we are dealing with generic noise of rate e. We can write 

G : p — ► (1 - e)G P G f + eN G (p), 

where Nq is some general noise operation. 

The analysis of the previous Section lTV-Al goes through line 
by line. The noise process is additive (with I/N in Eq. (|9} 
replaced by £\G2Nq 1 (p)G\ + £2-^2 (p))- Th e encoding and 
decoding operations can then be written as 

£e E : p^(l-e E )U B pUl + £ -^I 

= (1 - e E )£{p) + e E N E {p) 
V ED : p ^ (1 - e D )V(p) + e D N D (p), 

where N E and No are the noise resulting from encoding resp. 
decoding. Concatenating yields 

P^{l-e)g{p)+eN DE 

with e ~ e E + Ed and the cumulative noise can be written to 
first order as 

eN DE = e E V{N{N E {p))) + e D N D {N{£{p)) 

. The new coding map with faulty gates is then very similar 
to before: 

9P f : N — > (l-e)VoMo£+eN DE (p) = {l-e)n c +eN DE . 

In other words, faulty gates introduce a perturbation to the 
original coding map studied in the previous section. They can 
be treated in the same way as noise on the qubits. In fact we 
see that the occurrence of faulty gates is the same as a process 
with increased noise on the gates and perfect gates. However, 
if the noise on gates is small compared to the noise on qubits, 
we can treat it as a perturbation to the original coding map. We 
will show how to incorporate such perturbations in the analysis 
with the following Lemma. Here, || ^Hc 1 denotes the C 1 norm 
of a smooth map h on its domain, that is, the maximum of 
the suprema of \h\ and ||D/i||. 

Lemma 4.1: Suppose U C I n is an open set, /:[/—> M™ 
is smooth (at least C 2 ), f(p) = p and A = ||D/(p)|| < 1. 
Then for small enough e > and every smooth map g : U — > 
R n , if || g — /He* 1 < £> m en g has a fixed point q such that 
||D ff ( g )||<land|g-p|<e/(l-A). 

In other words, if a map has an attracting fixed point, 
then any sufficiently small C 1 perturbation of it also has an 
attracting fixed point which is close to the original one. 
This is a standard fact from the theory of dynamical systems; 
for completeness, we supply a proof here. Proof: Let 
M be an upper bound of ||D 2 /| on some relatively compact 
neighborhood V of p. Since A < 1, there exists r > such 
that / maps the closed ball B[p, r] into itself and B[p, r] C V. 
Without loss, we can take r so small that r < (1 — X)/M. 
Assume < e < min((l — A)r, 1 — A — Mr). Then it is 
not difficult to show that for every x £ B[p, r], \g{x) — p\ < 
e + Xr < r, which means that g takes B[p,r] into itself. 



Therefore, by the Brouwer fixed point theorem, g has a fixed 
point, say q, in B[p,r}. Since 

\q-p\ = \g(q)-p\ 

< \g(q)-m\ + \f(q)-p\ 

< e + A|?-p|, 

we obtain \q — p\ < e/ (1 — A). 

To show that q is an attracting fixed point for g, let us show 
that ||D.g((7)|| < 1. Observe first that ||D/(q)|| < Mr + A < 
1-e. Therefore, ||D S (g)|| < \\Bg(q) -T>f(q)\\ + \\Bf(q)\\ < 
1. ■ 
It is clear from JlOi that the coding map Q9 of a code with 
faulty gates is a C 1 small perturbation of the coding map £l c 
with perfect gates. 

V. Analysis of Channels 

In this section we will give several technical results about 
channel maps, which we will subsequently use to analyze vari- 
ous diagonal and non-diagonal channels and to give examples. 
In particular we will study in detail how non-diagonal elements 
of a noise channel affect its convergence and threshold. 

A. The two-point theorem 

We look at bounds for a general channel, resulting in Thm. 
1541 

Lemma 5.1: For any non-identity Pauli matrix ct, 

Nl x +Nl Y + Nl z <(l-\N aI \) 2 (11) 
{N XI ±N Xa f + (N YI ±N Ya ) 2 (12) 
+ {N ZI ±N Za f < 1. 

All elements of the channel are real. 

Proof: jV preserves hermiticity, and is positive (sends 
non-negative p to non-negative p) [7]. The first condition 
implies that the elements are real. Then the adjoint channel, 
which has the map jf^p = ^ fc A^pAk, is also positive. A 
simple calculation shows that a matrix p = cjI+cxX+cyY+ 
czZ is non-negative if and only if C] > y/ c 2 x + Cy + c%. 

Let C = V N aX + N *Y + N *Z> and a PP!y 

JK(cI ± (N aX X + N aY Y + N aZ Z)), 

which gives cj = c, and c a = cN a j ± c 2 , so the non-negative 
condition gives lo/Vcr/ic 2 ] < c, from which we get c 2 < 
(1 — \N a j\) 2 , which gives equation ITU 

Let b aa , = N Xa N Xa > + N Ya N Ya , + N Za N Za >. Now let 
c = y/bj} + b aa ± 2b j a . Then, apply JY^ to 

cl - {N XI X + N YI Y + N ZI Z)± (N Xa X + N Ya Y + N Za Z) , 

which gives a = c — bn ± bi a and c a = —bi a ± b aa , so 
c - bu ± b Ia > \-b Ia ±b aa \, which gives c > b u + b a<y ± 
2bj a = c 2 , so c < 1, which gives equation IT2l 

This proof extends naturally to multi-qubit channels. 

■ 

Corollary 5.2: Each row of a quantum channel jV in the 
Pauli basis has norm at most 1. 

Proof: Since |jV CT/ | < 1, we have 1-N 2 j > (l-\N aI \) 2 , 
and so the result follows from Eq. (II It . ■ 
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Corollary 5.3: Let A = N 2 XI + N YI + N' ZI be the non- 
unital portion of the channel. Then we have that any other 
column of the channel in the Pauli basis has L2 norm squared 
N 2 Xa +N 2 a + N 2 Za <l-A. 

Proof: Follows immediately from Eq. H2\ . ■ 

Theorem 5.4 (Two-point theorem): If two of Nxx, Nyy, 
Nzz are 1, then the channel is the identity channel. 

Proof: Let a%, 02, 03 be some permutation of the Pauli 
matrices such that N aiai = N rJ2rJ2 = 1. From Corollary 15.21 
N aiai and N a2a2 are the only non-zero elements in their rows. 
From Corollary 15.31 the non-unital part must be 0, and N airTl 
and N r , 2!J2 are the only non-zero elements in their columns. It 
then follows that the channel is diagonal. From the conditions 
on diagonal channels given in Eq. (0, it easily follows that if 
two terms are equal to 1, the 3 rd term must equal 1, and so 
we have the identity channel. ■ 

B. Example: Generalized Shor codes 

In this section we give give a first application of our 
formalism and the general bounds we obtained. We study 
generalized Shor codes, which are bit flip and phase flip codes 
concatenated with each other. We will assume a diagonal 
channel [x,y, z] in what follows. Note that Thm. l5~4l is easy 
to prove in this case; it follows immediately from Eq. Q- 

b) Bit flip, phase flip: The n qubit bit flip code is a 
classical code on n qubits that corrects all bit flip errors on 
less than ^ qubits and none of the errors on greater than 
j qubits; if n is even it also corrects half of the errors 
on exactly ^ qubits. The coding map is VP* 71 [x : y, z] = 
[x n , h n (x, y, z), ,f n (z)]. To see this note that the code does not 
correct phase flips (Y or Z errors), and so if p = py +Pz, the 
p-component of the coding map must be a function of only 
p. Since x = 1 — 2(py + Pz) = 1 — 2p, it follows that the 
x-component of the coding map must be a function of only 
x. The only such element of the X equivalence class gives us 
x n . 

To see that the z-component depends on z only, note that 
the code can correct bit flips (X or Y errors), sending them to 
/ or Z errors, respectively, and so if p' = px +py, by similar 
reasoning as above we observe that the p' component depends 
only on p' and hence that the z-component is a function of 
only z. Now, assume only X errors. Then z = 1 — 2px, and 
f n (z) = 1 — 2g(i=^), where g(p) is the failure probability as 
a function of an X error rate of p. We can obtain g(p) from 
the properties of the classical bit flip code. 

Since the function h n (x,y,z) does not affect the x and z 
components of the channel, from Thm. 15.41 we may ignore it 
for the purposes of convergence to the identity channel. 

Some values of /„ are 

fx{x)=h{x)= x 
h{x)=f i [x)= \x-\x 3 
h(x)=f 6 (x)= fx - fx 3 + fx 5 . 

For the phase flip code we get similarly Q p ^ n [x, y,z] = 
[f n (x), h' n (x, y, z), z n ] by exchanging the roles of x and z. 

These codes will have two critical values, x c and z c . If 
x > x c then x — > 1, and similarly for z. 



c) Specific codes: We can now obtain sharper results 
for the error threshold of concatenated bit flip and phase flip 
codes, extending [1]. 

The often discussed [[9,1,3]] Shor code has the 
coding map: n shor [x, y, z] = ^iW 3 [z, y, z] 
[fl(x),h"(x, y, z), / 3 (z 3 )]. We define a [[25, 1, 5]] code to be 
n 25 = QP^n b ^, and a [[15, 1, 3]] code to be ft 15 = fi^fi 6 ** 

The [[25,1,5]] code has critical values of x c = 0.916208, 
and z c — 0.645611. The [[15, 1, 3]] code has critical values of 
x c = 0.794438 and z c = 0.850432. If x = z, the [[15, 1,3]] 
code performs much better than the [[25, 1, 5]], even though it 
is less redundant. 

C. Convergence of non-diagonal channels 

In this section we will establish some general results for 
non-diagonal channels in the case of stabilizer codes [8]. 
Non-diagonal channels are in general much harder to analyze 
than their diagonal counterparts, as the parameters span a 12- 
dimensional manifold. However, we will show that in certain 
cases these channels converge to diagonal channels, and will 
discuss when these converge to the identity channel. 

We can decompose the single qubit noise operator JV as 

,yK = L + eM, (13) 

where L is the diagonal part, and e is chosen such that M 
has no term with absolute value more than 1; it contains the 
off-diagonal terms. We show that if e is sufficiently small and 
d > 3, then repeated application of the coding map yields 
a diagonal matrix. This will allow to restrict our analysis to 
diagonal channels, at least in certain regimes. 

We wish to analyze the absolute values of the difference that 
the non-diagonal terms make on the channel after we apply 
the coding map. Define the difference matrix 

r = n c (,yK) -n c {L). 

Let us assume that the code is an [[n, k, d]] stabilizer code [8] 
(it encodes k qubits into n qubits, and has distance d, which 
is the minimal weight of an undetected error). Let m be the 
minimal weight of a non-identity stabilizer element. 

Theorem 5.5: The non-diagonal terms of the difference 
matrix T have absolute value at most CdS d . The diagonal terms 
of r are at most c m e m in absolute value. These coefficients 
are bounded above by 

max(c d , cm) < 2"- k £ \® a , a „ \ < 4"- fc . (14) 

Proof: We can rewrite Eq. (|4) as 

G aa i = T> a N£ a > , (15) 

where S a is the a column of £ and similarly for T>. The 
(non-zero) entries of Sj are the stabilizer elements, and the 
non-zero elements of S a are a times the stabilizer elements, 
where a is the encoded a. We note that § a i a is non-zero only 
if a' and a are in the same equivalence class of C(S) modulo 
S, where S is the stabilizer group, and C(S) is its centralizer 
(see [8] for more detailed definitions). 

Now the non-diagonal elements of T depend on the non- 
zero elements of § a and S a i with a 7^ a', which correspond to 
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the a and a' equivalence classes of C(S), which differ on at 
least d qubits. Then from Eq. resp. Eq. dl5> . it follows that 
the non-diagonal terms involve at least d non-diagonal terms 
of Af and are hence 0(e d ) from Eq. Jl 31 . The difference of 
the diagonal elements corresponds to elements of the same 
S a , which differ on at least m qubits, since m is the minimal 
weight of different elements in the same equivalence class 
(non-zero elements of the same §„). Hence they are 0{e m ). 

From Eq. dl5> it is easy to see that the coefficients Cd and 
Cm are bounded above by 



Y, \®*a»e*»>a>\< 



<j" ,cr f/f 



where we used that each coefficient is at most 1 in absolute 
value and the cardinality of the stabilizer group. ■ 
Note that in certain cases we have explicit expressions for 
Yin-" \^<t'o"\> which can come from calculations with a diag- 
onal noise channel and can give us tighter bounds on Cd and 
c m than the generic 4 n_fc . 

d) Convergence to the identity: Suppose we concatenate 
the above coding map i times. Then the absolute values of the 
off-diagonal terms are bounded above by <2j, where ao = e, 
and a n+ i — c^a^. Then, from Thm. 15.51 

ai — c d e — e { — ) , 

where £o = d y tt: is defined for d > 1. Since these affect the 
diagonal terms by at most c m e m , we can bound the correction 
for the diagonal terms as 



d j 

u i ^n 1 *!-! ^m^d 



= CmS^(-y 

so 



(16) 

Now we assume that the non-diagonal terms go to 0, which 
means that e < Eq, and so ai and bi both go monotonically 
to 0. From Thm.|53] we can see that if the map O c (L®") - 
c m £ rn I converges to within 0(e m ) of the identity matrix, then 
so does Qp (jY® n ). However, we can get a tighter bound than 
this. 

Let Lo = [xo,yo,zo] be the diagonal part of the channel. 
We define Lj = O c (£^_i) — bil. We can think of the Li 
as a lower bound on the diagonal part of the channel. Then, 
the channel goes to [1,1,1], if Li — > [1,1,1]. These coding 
maps are fif(L) = Q c (L i ^ 1 )-b i I, and Of (L) = Q C (L )~ 
c m £ m I- The channel converges to identity if 

...oQ$ ofifL= [1,1,1]. 

D. CSS codes on 1 qubit with a generalized noise channel 

In this section we tighten our result in the case of CSS codes 
[9], [10], [11]. 

Let our code be a [[n, 1, d}] CSS code. From the construction 
of CSS codes from classical codes, n must be odd. Its stabilizer 
group is generated by n — 1 generators, half of which depend 
only on tensor products of Is and Xs, and the other half are 
the same, except they have Zs replacing the Xs. We can write 
the stabilizer group S as the span of {S(X), S(Z)}, where 
S(A) S As, and Ag is the n-dimensional Pauli Matrices 
£P n which only depend on tensor products of / and A. 



The stabilizer elements in S(X) are used to correct against 
Z errors, and the stabilizer elements of S(Z) are used to 
correct against X errors, and so we can write the set of 
recovery operators as R(ex,Z) and R(ez,X), where ea 
are the components of the syndromes obtained by measuring 
stabilizer generators from S(A), and each R(e, A) g As- 
The Pauli operators are encoded as 



X = X® n 6 X s 
Z = Z® n 6 Z s 

Y = ixz = (-i)^y®™ e y s . 



(17) 



To obtain a convenient representation of the decoding operator 
V, we define the average recovery function as 



1 



where the Ri are the recovery operators (see Sec. HJ. Let 
& E ^#2" 2 n be the the diagonal matrix given by 



% a = T](Rav, a) 



(18) 



Where 77 is the linear homomorphism defined in Sec. UTI Eq. 
|6] In particular note that if a commutes with all recovery 
operators Ri, then ,%- a = 1, and if a anti-commutes with all 
of the recovery operators then 2F aa = — 1. Then, from [1] we 
obtain for the decoding matrix 



(19) 



Lemma 5.6: The non-zero elements of 3>x must be con- 
tained in Xs, and similarly for Z, although usually not for 
Y. 

In particular this implies that if a — X or a = Z, then ^ CT(T / 
depends only on N a i, N„x, N a y, and N a z< Then to find 
convergence of the X and Z rows, we can look at these rows 
separately. 

Proof: Since 3lj — I, the non identity stabilizer elements 
must commute with half of the recovery operators. Only the 
non-zero elements of ^ CT don't commute with exactly half 
of the recovery operators. This implies that each non-identity 
element of S(X) commutes with half of the elements of Rz = 
R{e x ,Z), and similarly for S{Z) and R x = R{e z ,X). If 
half of either Rx or Rz commute with some element of S, 
then half of all of the the recovery operators commute with 
it. Now, pick some non-zero element c = X® n sxsz of §x, 
where Si £ S(i). If c ^ Ax then sz ^ I- Then, if an element 
r € Ax, it follows that r/(r,c) — i](r,sz), and so, half of 
Rx commutes with c. Then, c must commute with half of 
the recovery elements, and so must be zero in 'Six- Then the 
non-zero elements of 2>x are in Ax- ■ 
Theorem 5.7: There exists functions fi(a,b,c,d) and 
f2(a,b,c,d) such that the following are is true for £f = 
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&XI = 


M'SYxi, Ixx, i<jYxY, JVxz) 


&xx = 


f 2 (^XX,i^XY, ~*xz, JYxi) 


&XY = 


i n f 2 {i^XY,^xz, Jfxx,Jfxi) 


&XZ = 


hi-^xz, ^xx,i.yYxY, .slxi) 


&ZI = 


£ ( A/ A/ ' A/~ A/ \ 

fi {■yrzi , Jrzx , i^rzY , ^zz) 


&zx = 


hi^zx^^YzY^zz, jVzi) 


&ZY = 


i n .f2(i^ / ZY, ^zz^zx^zi) 


&ZZ = 


hi^ZZ^ZX^ZY^Zl)- 


Furthermore these functions fi(a, b, c, d) and f 2 (a, 



symmetric under permutations of b, c, and d. 

Proof: The permutation X — > iY — > Z — > X, sends 
(ff = 5 to itself, and sends 

— * % n Sy — ► (o^j — ► 

Then, from lemma 1531 and the fact that X «-> Z sends i^x <-> 
^x, /i and /a must exist as stated. 

As for the symmetries, *$xi depends on 2#x and Si. By 
permuting X, iY, and Z, we preserve the stabilizer elements 
which are the non-zero elements of Si, and so <Sxi is fixed 
under permutations of Nxx, iNxy, Nxz- ^xx depends on 
&x, an d §x- By permuting /, Z, and iY, we preserve the 
non-zero elements of Sx, which are X times the elements of 
Sj (see Eq. H7\ ). and so ^xx is fixed under permutations of 
Nxi, iNxY, and Nxz- The other cases follow similarly. ■ 

Lemma 5.8: Let a ^ a' be single qubit Pauli matrices and 
let a" be a non-zero element of S a . Then <j' appears tensored 
an even number of times in a" . 

Proof: In the case where a = I, S a corresponds to the 
stabilizer group. Since S is generated by even weight elements 
in Xs and even weight elements in Zg, in order for it to be 
Abelian, it must have the above property. For general a we 
have Sfj — aS, and, using a is a on all qubits, the desired 
result follows. ■ 

Theorem 5.9: A CSS code takes a channel ,yV 
to the identity channel if and only if both vectors 

[y^xi^xx.-yVxz.i^VxY] and [Jrzi, Jfzz, ^zx, iJVzy] 
converge to [0, 1, 0, 0] under the map 

[a, 6, c, d] — > 

[fi(a, b, c, d), f 2 (b, c, d, a), / 2 (c, d, a, b), if 2 (d, a, b, c)]. 

In fact, it is sufficient that they converge to [*, 1, *, *]. 
Proof: Obviously, this is a necessary condition. From 
Lemma 15.81 we see that each of the variables b, c, and d in 
/i (a, b, c, d), and f 2 (a, b, c, d) must appear an even number of 
times in each term. So we may ignore any —1 sign in front of 
&xy or &ZY- From the symmetries we have, it then follows 
that the above map determines convergence on the X and Z 
rows. The rest of the theorem follows from Thm. 15.41 ■ 
Remark (Unital channels): In the case of unital channels, the 
above reduces to the condition that both [jVxXi ^xz, iAy] 
and \jVzz, ^zx-, i^zy] converge to [1, *, *] under the map 

[a,b,c] -> [f 2 (a, b, ic, 0),f 2 (b,ic, a, 0),if 2 (ic,a, 6,0)]. 



Notice that this no longer depends on fi. 

Lemma 5.10: For CSS codes, we have max(c,j,c m ) < 
2z(n-k) f or Cd an( j Cm as defined in Thm. 15.51 

Proof: We use the bound of Thm. 15.51 for the non- 
diagonal terms. In the case of a CSS code, we have for A = X 
or A = Z that 2>a C As, and so the non-zero entries are given 
by S{A)A® n . Therefore the sum in Eq. l[l4} has only 2^ 
entries, giving an overall coefficient of 22(™~ fc ). ■ 

1) Doubly-even CSS codes: Doubly even CSS codes are 
CSS codes that have weight divisible by 4 for S(X) and S(Z). 
For these codes we can strengthen Thm. 15.91 Define functions 
gi and 172 that are the same as the fi and f 2 defined in Thm. 
15.71 without the factors of i. 

Theorem 5.11: A doubly even CSS code takes a 
channel jV to the identity channel if an only if both 

[^xi,^xx,^xz,^xy} and [J^zi^zz^zx^zy] 
converge to [0, 1, 0, 0] under the map 

[a, b, c, d] 

[gi (a,b,c,d),g 2 (b,c,d,a),g 2 (c, d,a,b),g 2 (d, a, 6, c)] . 

Proof: The stabilizer group is formed by generators £ 
Xs, and generators 6 Zs, each with weight divisible by 4. 
Then X and Z together appear a number of times divisible 
by 4 in each stabilizer element (and similarly for {X, Y}, 
{Y, Z}). Following similar reasoning to that of the proof of 
lemma 15.81 we find that c and t together appear a divisible 
by 4 number of times in each term of fj (j = 1,2), Then, 
fj(a, b, c, id) = fj(a, b, ic, d), and by definition 

gj(a,b,c,d) = fj(a,b,c,id). (20) 

These gj satisfy all the symmetries above and the convergence 
relations of Thm. 15.91 (without the factors of i). ■ 

2) Example: [[7,1,3]] CSS code: We use the example of 
the [[7, 1,3]] code, a doubly even CSS code commonly used 
in fault tolerance calculations, to illustrate how to find the 
functions defined in Thm. 15.71 and use Thm. 15.91 to analyze 
the convergence of channels under this code. 

a) Computation of the coding map: The 
stabilizer group of this code is generated by the 
elements IIIXXXX, IXXIIXX, XIX IX IX and 
IIIZZZZ, IZZIIZZ, ZIZIZIZ. Using the notation from 
section IV-CI the non-zero elements of Si are the stabilizer 
group elements. 

Si = J2 S ={imiii + IIIXXXX) 

(IIIIIH + IXXIIXX)(IIIIIII + XIXIXIX) 
+ {11111 II + IIIZZZZ) 
(II II II I + IZZIIZZ)(IIIIIII + ZIZIZIZ) 

We have X = XXXXXXX, and Z = ZZZZZZZ. 
One notices that there are 7 terms that are some permuta- 
tion of IIIXXXX. Let p 7 (IIIXXXX) denote the sum 
over these permutations. p 7 (IIIYYYY) and p 7 (IIIZZZZ) 
give us the corresponding permutations of IIIYYYY 
and IIIZZZZ. Similarly, there are 42 terms that are 
— IZZXXYY, up to some permutation, so we define a 
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function P4 2 (— IZZXXYY) to sum over these. Then we can 
write 

g 1 = II II III + p A2 {-IZZXXYY) 
+p 7 (IIIXXXX + IIIYYYY + IIIZZZZ). 

With §„ = Sfa we get 

S x = XXXXXXX + p i2 {-XYYIIZZ) 
+p 7 (XXXIIII + XXXZZZZ + XXXYYYY) 
g Y = -YYYYYYY - p i2 {-Y XXZ ZII) 
~p 7 {YYYZZZZ + YYYIIII + YYYXXXX) 
S z = ZZZZZZZ + p i2 {-ZIIYYXX) 
+p 7 (ZZZYYYY + ZZZXXXX + ZZZIIII). 

The recovery operators which depend on X are 

R(e z ,X) = {IIIIIII, XIIIIII, IXIIIII, 

ii xii ii, i ii xii i, mix ii, iiiiiixi, ii ii ii x}. 

Combining these with the recovery operations in R(ex, Z), we 
easily find all 64 recovery operators. There are 1 in the form 
IIIIIII, all 7 permutations of IIIIIIX, all 7 permutations 
of IIIIIIY, all 7 permutations of IIIIIIZ, and all 42 
permutations of 1 1 II IX Z. Eq. dl9l now allows us to find the 
elements of <S a . We calculate & x from g x . XXXXXXX 
commutes with ^ recovery elements, XXXIIII commutes 
with |2 recovery elements, and XXXZZZZ, XXXYYYY, 
and XYYIIZZ each commute with || of the recovery 
elements. Then 



\p 7 (XXXIIII) — XXXXXXX 



\X XX III I 
+ \IXIXIXI 



\IIXXIIX 



i XII 1 1 XX 



-XIIXXII 



-IIXIXX 



4 

1 

+ ~A 
1 

+ A 

+-IXIIXIX 
4 

--XXXXXXX. 
4 



= -fZZZZZZZ 



A similar calculation shows that ^z 
\p 7 {ZZZIIII), but $> Y doesn't follow this pattern. 

Now we wish to compute &xi- First we look at how 
the jp 7 (XXX II II) component of £>x contributes. From 
Ni a = 8 i a , it follows that only elements in 3>i that are 
identity on the last 4 qubits contribute. This is just IIIIIII, 
so we get p 7 (lNxiNxiNxiN II N II N II N II ) = fiVj^. For 
the -^XXXXXXX component of @ x , everything in «?/ 
contributes. This gives a contribution of 

- \{NxiNxiNxiNxiNxiNxiNxi 

+ p 7 {NxiNxiNxiN X xNxxNxxNxx 

+ NxiNxiNxiNxyNxyNxyNxy 

+ NxiNxiNxiNxzNxzNxzNxz) 

+ P42(-NxiNxzNxzNxxNxxNxyN X y))- 



Together, these give 



*xi 



= -N 3 
4 



XI 



XI 



7N XI {N 4 



xx 



N%y 



+N XZ ) - A2NxiN xx N 2 XY N xz ). 



A similar calculation shows that 



%xz 



N xx 



XX 



7N 



xx {N X i 



N XZ 



+N XY ) - 42NxxN% I N% z N% Y ) 



-N XY --(N 7 XY 



7N 3 XY 



(N 



XI 



N 



xx 



-N xz ) 



42N XY N XI N XY N 2 



-N 3 

4 



xz-^N 



xz 



7N 3 XZ 



(X 

2 7\r2 



XI 



N' 



xz) 

4 

XY 



+N XX ) - A2N XZ N XI N XY N- XX ). 

For the functions gj, which are related to fj by Eq. i20\ . we 
obtain 



g(a,b,c,d) := gi(a,b,c,d) 



7 o 
= -a 3 - -a 7 

4 4 



Note that 



- 2 ±a 3 (b i + c A + d A ) + 



g 2 (a,b,c, d) 
63 

T 



-ab 2 c 2 d 2 . 



&XI 
^XX 
%XY 

&xz 



g(N XI ,N xx ,N XY ,N xz ) 
g{Nxx,N X Y,Nxz,Nxi) 
-g(NxY,N xz ,Nxi,Nxx) 
giNxziNxi^Nxx^NxY)- 



b) Analysis: We consider the convergence of a row of 
the channel matrix [a, b, c, d] as in Thm. 15.1 II We have from 
Thm.l5"2lfhat 



+ b 2 + c 2 + d 2 < 1. 



(21) 



If the channel is diagonal (or in general in the case where all 
but one parameter a, b, c or d are zero) we have a critical point 
x c = 0.870807 such that g(±x c , 0, 0, 0) = ±x c . 

Let us now analyze the behavior of non-diagonal channels 
with small off-diagonal elements. 

Theorem 5.12: If any of a, b, c, or d is within x c of 0, it 
must go to 0. 

Proof: This can be proved in general by a rather lengthy 
calculation. To convey the main idea we will here only give 
the proof in the case where one of the 4 variables equals 
(for example, a unital channel). Then our function g becomes 
g(a, b, c) = \a 3 - |a 3 (a 4 + 76 4 + 7c 4 ). We want to show that 
that if < \a\ < x c , we have that \a\ > \g(a,b,c)\. Without 
loss of generality, we may assume that a is positive. Below the 



critical value x r 



we have a > |a 3 - 



3„7 



so we only need to 



see if a < —g(a, b, c), which is maximized by b = \/l — a 2 , 
c = 0. A simple calculation shows that there is no solution. 
Then it follows that \a\ must monotonically go to 0. ■ 
From Thm. 15.121 and Eq. (12 1 i we easily see that the vector 
[a, b, c, d] must converge to a vector with at most one non- 
zero coefficient. Now suppose that a is slightly above x c , and 
that b, c, and d have absolute values of at most some small e. 



10 



We wish to see how much s changes the critical convergence 
value for a. Let k := d9(a ^'°' 0) \ Xc = 1.691859. Then, 



> g(a, 0,0,0) 



g(a,b,c,d) > g(a,e,e,e) 

63 o 4 , . 63 

- e « fc(a — x c ) + x c 



-a 3 e 4 . 



4 v " — 4 

Since b, c, and <i become 0(e d ) = 0(e 3 ) up to 4th order of e, 

63 
4 



the vector converges to [1,0,0,0] for fc(a— a; c )+x c — ^a 3 e 4 > 



x c , which implies that 

63 



63 



63 u.) 
k(a - x c ) > -^-a 3 ^ 4 ~ -^ £4 ( x l + 3 (a - x c ) 2 ) w ^-e 4 ^c 

Solving up to first order for our new critical value, we get 

Qixle 4 



4A- 



x c = 6.14726e 4 + x c . 



This implies that the off-diagonal terms affect the threshold to 
fourth order (as implied by Thm. 15. 5> : but here we improved 
the prefactor c t . Note that Lemma 15 . 1 01 would have given a 
pref actor of 512. 

If we choose a larger number instead of 6.14726, for 
example 7, then our vector converges to [1,0,0,0] from 
[x c + 7e 4 , e, e, e] for e as big as 0.3. 

VI. SVD CANONICAL FORM 

In this section, we follow the method of [3], applying 
unitary gates before and after our channel to create a new 
channel that has fewer parameters. This can be used to improve 
the region of convergence to the identity channel. 

Lemma 6.1: Let <jj be the non-identity elements of the 
Pauli group & '. Then if U — e*'"' 1 , then the unitary channel 
p — > U pU^ performs a rotation by 9 in the a^a^ plane. 
Expressing the unitary gates as channels in the Pauli basis 
creates a bijection from SU(2)/(±I) to 1 © 50(3). 

The Singular Value decomposition (SVD) theorem [12] 
states that if A is a real matrix, then there exists D = 0\AO\ 
such that the Oi are orthogonal, and I? is a diagonal matrix 
with elements \, > 0, which are called the singular values of 
A. Then D = sgn(det A)RlAR[, where Ri e SO(n). 

Theorem 6.2: If jV^ is a channel on one qubit, then there 
exists a channel 



\t' 3 




±Ai 







±A 2 




(22) 







±a 3 ; 

± designates the sign of 



where % e SU(2), and the 

Proof: From Eq. [5] define the vector t = 
(Nxi, Nyi, Nzi), and let A be the 3 x 3 matrix with the 
other 9 variable elements. From the SVD theorem, we have 
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o r\ 



1 
t A 



1 
R x 



1 
t' ±D 



where t' = R\t = (t^t^t^). The outer matrices are unitary 
channels by lemma l6~TI ■ 
Note that II t II = lit' II, so if the channel is unital, t' = 0. 
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A. CSS codes 

We now apply the above to CSS codes, and in particular 
examine the [[7, 1, 3]] CSS code. 

Proposition 6.3: For a given CSS code with a channel 3? in 
the canonical form of Eq.|22] if at least 2 of [t[, Aj] converge 
to [0,1] under the map [a, b] — > [/i(a, b, 0, 0), f 2 (b, a, 0, 0)], 
where the fa are the functions from Thm. 15.71 then by applying 
unitary gates before and after the channel ST, we can create a 
new channel ST' that converges to the identity. 

Proof: Suppose that [i^, Aj] and [t'j, \j] converge to [0, 1] 
under the given map. We define a matrix A £ SO (A) such 
that a i — > ctj, <7j — > ax, <Jj — > fz, and the diagonal matrix 
B = [1, ±1, 1, ±1], By lemma IQ1 these are unitary channels. 
Then, 



J' = A-STA^B = 



and the rest follows from Thm. 15.91 

■ 

Note that even if ST' was diagonal, the order of the could 
affect whether it converges to the identity channel. 

1) Example: The [[7,1,3]] CSS code: For the [[7,1,3]] 
code, the map is h([a, b]) = [g(a, b), g(b, a)], where g(a,b) = 
^(7-3a 4 -216 4 ). Let [a n ,b n ] = h ok ([a,b}). This converges 
to [0, 1] if and only if b n — > 1, 

Now we are interested in which [a,b] converge to [0,1]. 
Using that a 2 + b 2 < 1, a numerical calculation shows that 
there is always convergence to [0, 1] for b > b c w 0.927334. 
For [a, b] — [sin 9, cos 9], this threshold is exact, and so 
these converge to [0,1] for \9\ < 9 C w 0.383572. For a 
unital channel, a — 0, and so this converges to [0, 1] for 
b > x c Ri 0.870807. In either of these cases, we just need 
at most one singular value of the channel to be less than or 
equal to the given critical value. 

We can find an approximate solution for the region of 
convergence to [0, 1] by solving b„ > x c . For n = 1, we 
have an approximation for the region of a < fi(b) = 
| — jb 4 — ^jp-- As n increases, these approximate regions 
rapidly converge to the actual region of convergence to [0, 1]. 

The singular values of a unitary channel are always 1. Note 
that if the unitary channel from lemma loTI is in its original 
non canonical form, it converge to the identity channel for 
|0| < C . 

VII. Conclusion and further questions 

A. Drawbacks of our approach 

The approach of integrating the sequence of concatenated 
encoding and noise as a rather simple map from channels 
to channels is very powerful. By abstracting away from the 
details of the encoding and the noise process, it drastically 
reduces the number of parameters, and makes the coding pro- 
cess amenable to a dynamical systems type analysis. However, 
this approach sometimes comes at a price. By ignoring the 
details of the coding and correction process, we might get 
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error thresholds above the actual thresholds if we accounted 
for all these details. The following example illustrates this, 
introducing the notion of a recovery function. 

Suppose we have a [[n, k, d}] stabilizer code. We define a 
recovery or error correcting function R(e) [13] which maps 
the collection of syndromes measured by the codes to some 
n qubit Pauli operator, R : j^" 2 n -* ~ ¥ We also define a 
syndrome function e : 3P n — > J£" 2 >»-*=, which maps Pauli errors 
to some syndrome. With these definitions we must have that 
(3 = e{R{(3)), for any [3 e Note that we can chose R(J3) 
up to elements of the stabiliser S without any difference for 
error correction. Hence our choices for R(f3) differ from each 
other by elements of the centralizer C(S) are limited to the 
4 fc elements of the Centralizer modulo the Stabilizer. They can 
be written as an element of C(S) times some representative 
element of S. To study the choice of recovery function on the 
channel, define the matrix T a € ^#4^4™ to be the diagonal 
matrix 

Then the matrix operator T, defined in Eq. (1181 . is T = 
Y,i T Ri . We have <$ = & Ri = 0, C (^Y), where the quasi- 
channel (they don't have to preserve trace) 

is the contribution of a single R4 on the channel map. 

When we measure a syndrome e during error-correction, 
we gain some information about the channel. Let the encoded 
state be described by the density matrix p = pjl + pxX + 
pyY + pzZ. We can re-write our channel Sf = Sl c ) as a 
sum over all syndromes 

/3eF 2 „„ fc 

If we measure \j3) and use the information, we collapse 
to a syndrome j3 with probability pp = tr(9f R ^'p) = 

2 & R a Pa, and the resulting density matrix is -^S R ^ p. 

In particular, if Sf^ = S?£ (/3) = <g R z im = 0, then Pj3 = 
2^ R j pi — @ R , which doesn't depend on p, and the 
resulting p-independent channel is then j^^ R ^- If we throw 
this information away we recover the coding map Sf from the 
previous sections. In other words the coding map approach 
corresponds to ignoring the information about the channel that 
we could have obtained from the syndrome measurements, to 
optimize the recovery functions. 

By performing measurements on the subblocks of a con- 
catenated code, we affect the channel on each qubit of the top 
level code. If we don't optimize our error correction, we are 
not being as efficient as we should be. For example, a distance 

3 code can't correct some 2 qubit errors, and so the code we 
obtain by concatenating it once with itself without changing 
the error correction function can't fix some 4 qubit errors. 
However, the distance d of a distance c?i code concatented 
with a distance d 2 code is d > d\d 2 , and so we should be 
able to correct any 4 qubit error. The problem is to keep track 
of all of this syndrome information, and finding the optimal 
error correction function seems to be computationally hard. 



B. Open questions 

We have initiated a dynamical systems approach to quantum 
error correction, extending the result of Rahn et al. [1]. This 
only opens the road to further analysis and many questions 
remain open. We list a few of them here. 

In our analysis we have always assumed that an error 
correction process is successful, if the associated coding map 
takes the noise channel to the identity channel. However, this 
might be too stringent a condition. Are there any other criteria 
for information retrieval, which are not equivalent to zero 
(corrected) error? 

Another question relates to the basin of correctable noise for 
a code: If our noise channel lies outside the basin of attraction 
of a certain code, can we find another code that would "lift" 
this noise into the basin of attraction of the old code? More 
specifically, given a code C (with d > 3) and a noise channel 
p S A—&c, is there another code C such that il c " (p) <E ^c? 
If the answer is positive, then the concatenation scheme G k oC' 
corrects p, as k — > 00. It would be interesting to formalise 
these ideas. 

Yet another question concerns the shape of the region of 
correctable noise. Is there a (non-trivial) bound for the size 
or shape of the domain of attraction? Can we characterize 
regions of noise that are not correctable by any code? There 
is a new and interesting bound on noise from which no circuit 
can recover in [14]. However the methods used there are not 
dynamical. Is it possible to make sharper statements? 
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